Corpus-based empirical analysis of form, function and frequency of characters used in Bangla
نویسندگان
چکیده
In this paper an attempt is made to understand formal and functional aspects of Bangla characters used in the written texts compiled in a sample monitor corpus designed systematically from language data collected from various text documents published within 1980 and 1995. The purpose of this study is to understand the form and function of the characters, trace their behavioural peculiarities, and if possible, find out the reasons of such peculiarities. The study focuses on the formation of the characters, their structural change in case of compound and cluster formation, their contextual use, statistical analysis of their occurrence, and their position in words. The study also encompasses the use of different punctuation marks in the texts. Finally, some possible areas of application of such analysis are identified.
منابع مشابه
Genetic Diversity and Nutritional Components Evaluation of Bangladeshi Germplasms of Kidney Bean (Phaseolus vulgaris L.)
Considering the crucial focus on plant developments as high yielding, protein, and disease-resistant varieties, in this study, the genetic diversity and nutritional traits of available kidney bean germplasms found in Bangladesh have been evaluated based on seventeen quantitative and six nutritional traits. Analysis of genotypic, phenotypic variance and covariance showed that higher environmenta...
متن کاملSelection of Dwarf Stature Yield Potential Lines from F3 Populations of White Maize (Zea mays L.)
'Dwarf stature' maize variety offers promises to withstand unfavorable growth environments of Kharif season. But, for developing such variety, dwarf stature inbred lines must be available. Here, twenty-four F3 populations of white maize were evaluated though assessment of their genetic variability, heritability, and character association for selection of dwarf stature promising lines based on y...
متن کاملFormant Analysis of Bangla Vowel for Automatic Speech Recognition
To provide new technological benefits to the mass people, nowadays, regional and local language recognition draws attention to the researchers. Similarly to other languages, Bangla speech recognition scheme is demandable. A formant is considered as the resonance frequency of vocal tract. Formant frequencies play an important role for the purpose of automatic speech recognition, due to its noise...
متن کاملThe Vocabulary Profile of Iranian English Teaching School books
This paper provides a fairly detailed corpus-based vocabulary profile of the Iranian EFL books used in public schools. To this end, the WordPerfect files of all the seven books were converted to text format to get rid of the formatting features and be compatible with the software used for analysis. The software tools used were the Compleat Lexical Tutor suite, version 6.2 (Cobb, 2011), AntConc ...
متن کاملDeveloping a Corpus-Based Word List in Pharmacy Research Articles: A Focus on Academic Culture
The present corpus-based lexical study reports the development of a Pharmacy Academic Word List (PAWL); a list of the most frequent words from a corpus of 3,458,445 tokens made up of 800 most recent pharmacy texts including research articles, review articles, and short communications in four sub-disciplines of pharmacy. WordSmith (Scott, 2017) and AntWordProfiler (Anthony, 2014) were used to sc...
متن کامل